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Abstract. The stochastic block model is a natural model for studying community detection in 
random networks. Its clustering properties have been extensively studied in the statistics, physics 
and computer science literature. Recently this area has experienced major mathematical break¬ 
throughs, particularly for the binary (two-community) version, see |26[ 1271121| . In this paper, we 
introduce a variant of the binary model which we call the regular stochastic block model (RSBM). 
We prove rigidity by showing that with high probability an exact recovery of the community struc¬ 
ture is possible. Spectral methods exhibit a regime where this can be done efficiently. Moreover we 
also prove that, in this setting, any suitably good partial recovery can be bootstrapped to obtain a 
full recovery of the communities. 


1. Definition of the model and main results 

The stochastic block model (SBM) is a classical cluster-exhibiting random graph model that has 
been extensively studied, both empirically and rigorously, across numerous fields. In its simplest 
form, the SBM is a model of random graphs on 2n nodes with two equal-sized clusters A and 
B such that |A| = \B\ = n and A H B = 0. Edges between various pairs of vertices appear 
independently with probability p = pn ii the two vertices belong to the same cluster and with 
probability q = qn otherwise. Thus, for any vertex, the expected number of same-class neighbors 
is a := Qn := p{n — 1) ~ pn, and the expected number of across-class neighbors is b := bn := qn. 

Given a realization of the graph, the broad goal is to determine whether it is possible (with high 
probability) to hnd the partition A, B; and if the answer is yes, whether it is possible to do so 
using an efficient algorithm. Otherwise, the best one can hope for is the existence of an algorithm 
that will output a partition which is highly (or at least positively) correlated with the underlying 
cluster. To this end, consider the space Ai of all algorithms which take as input a hnite graph 
on 2n vertices and output a partition of the vertex set into two sets. Informally, we say that an 
algorithm in A4 allows for weak recovery if, with probability going to 1 as n goes to infinity, 
it outputs a partition such that |AAA| -|- |BAR'| = o{n) (here A denotes the symmetric 

difference). We say that an algorithm allows for strong recovery if, with probability going to 
1 as n goes to infinity, it outputs the partition {A,B). Finally, an algorithm in Ai will be called 
efficient if its run time is polynomial in n. 

The problem of community detection described above is closely related to the min-bisection 
problem, where one looks for a partition of the vertex set of a given graph into two subsets of equal 
size such that the number of edges across the subsets is minimal. In general, this problem is known 
to be NP-hard |12j : however, if the min-bisection is smaller than most of the other bisections, the 
problem is known to be simpler. This fact was noticed a few decades ago, with the advent of the 
study of min-bisection in the context of the SBM. In particular. Dyer and Frieze [9] produced one 
of the earliest results when they showed that ii p > q are fixed as n —>■ oo then the min-bisection is 
the one that separates the two classes, and it can be found in expected O(n^) time. Their results 
were improved by Jerrum and Sorkin [T^ and Condon and Karp [7]. Each of these papers were 
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able to find faster algorithms that worked for sparser graphs. The latter work was able to solve the 
min-bisection problem when the average degrees were of order 

Until a few years ago most of the literature on both the min-bisection problem and community 
detection in the SBM had focused on the case of increasing expected degrees (i.e. a, 6 —>■ oo as 
n —)• oo), with the best results at that time showing that if the smallest average degree is roughly 
logn, then weak recovery is possible (e.g., McSherry [25] showed that spectral clustering arguments 
can work to detect the clusters in this setting). Recently, the sparse case, i.e. when a, 6 = 0(1) has 
been the focus of a lot of interest. This regime is interesting both from a theoretical and an applied 
point of view since a lot of real world networks turn out to be sparse; for more on this see [TH|. Coja- 
Oghlan demonstrated a spectral algorithm that finds a bisection which is positively correlated with 
the true cluster when the average degree is a large constant |B]. Using ideas from statistical physics, 
Decelle, Krzakala, Moore and Zdeborova gave a precise prediction for the problem of recovering a 
partition positively correlated with the true partition in the sparse SBM [8|. The prediction was 
rigorously confirmed in a series of papers by Mossel, Neeman and Sly [261 [27|, and Massoulie [21], 
where it was shown that this level of recovery is possible iff (a — 6)^ > (a + 6). More recently, 
|28j found necessary and sufficient conditions for a and b under which strong recovery is possible. 
Before them, Abbe, Bandeira and Hall [T] also characterized strong recovery assuming the edge 
probabilities to be constant factors of 

In [26] Mossel, Neeman and Sly proposed two regular versions of the SBM in a sparse regime, 
and they conjectured thresholds for the recovery of a correlated partition for each of the models. 
They also suggested that spectral methods should help to differentiate between the regular SBM 
and a random regular graph. In this article we study a slightly different version of a regular SBM 
where in addition to the graph being regular, the number of neighbors that a vertex has within its 
own community is also a constant. Formally, we have the following definition. 

Definition 1. For integers n,di and d 2 denote by G{n,di,d 2 ), the random regular graph with 
vertex set [2n], obtained as follows: Choose an equipartition (parts have equal sizes) {A,B) of the 
vertex set, uniformly from among the set of such equipartitions. Choose two independent copies of 
uniform simple di-regular graphs with vertex set A, respectively B. Finally, connect the vertices 
from A with those from H by a random fi 2 -bipartite-regular graph chosen uniformly. We refer to 
this family of measures on graphs as the regular stochastic block model (RSBM). 

The goal of this article is to investigate the similarities and differences between the RSBM and 
the classical SBM. For the rest of the article we assume that min{di,d 2 } > 3. This assumption 
implies that, with high probability, the resulting graph is connected. This differs from the SBM 
with bounded average degree, which has a positive density of isolated vertices, which make strong 
recovery impossible. The constant degree of all the vertices in the RSBM makes the local neighbor¬ 
hoods easier to analyze; however, as this model lacks the edge-independence present in the SBM, 
some computations become significantly more difficult. 

Throughout the rest of the article we say a sequence of events happen asymptotically almost 
surely (a.a.s.) if the probabilities of the events go to 1 along the sequence. The underlying measure 
will be always clear from context. 

Our first result, the next proposition, pertains to the rigidity of RSBM; it says that the RSBM 
is asymptotically distinguishable from a uniformly chosen random regular graph with the same 
average degree. Below, H-, -Wtv denotes the total variation distance between measures. 

Proposition 1. Let be the measure induced by Q{n,di,d 2 ) on the set Reg(2n,di -|- ^ 2 ) of 
all (di -|- d 2 )-regular graphs on 2n vertices and let be the uniform measure on the same set 
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Reg(2n, di + ^ 2 )- Then for any positive integers ^ 1,^2 > 3, 

lim = 1. 

n^oo 

This result sharply contrasts the RSBM and the SBM (which is indistinguishable from an Erdos- 
Renyi random graph with the same size and average degrees satisfying (a — 6)^ < (a + 6) [26]). 

In order to determine whether it is possible to recover the partition in the RSBM, one must 
first answer a basic question about the random graph G{n,di,d 2 )- is the ‘true partition’ {A,13) 
identifiable. I.e., is {A, B) the only way to partition the graph such that the subgraphs on the parts 
are di-regular (which then implies that the subgraph across is (i 2 -bipartite)? The following result 
shows that the answer is yes if di and ^2 are sufficiently large. 

Theorem 1. There exists a constant d' > 0 such that, for di > d 2 > d', Q{n,di,d 2 ) has a unique 
partition a.a.s. 

The particular value of d' that we get is far from optimal; we conjecture that the conclusion of 
this theorem should be true for d' = 2. The proof of Theorem is quite technical and is given in 
section [3l 

To our knowledge, this is the first uniqueness of partition result for block models with constant 
degrees. Such a result is not true, however, in the classical setting where the edges are independent, 
since with constant probability one has isolated vertices. 

If the original partition is unique in most cases then one can, in principle, find the original 
partition by exhaustive search, and hence achieve strong recovery. This is again in sharp contrast 
with the SBM, where strong recovery is achievable only in the case of growing degrees. 

The next natural direction is to look for an efficient algorithm for strong recovery. While we 
do not answer this question in general, we do exhibit one regime where such an algorithm exists. 
The nature of the algorithm is essentially spectral. We believe that the regime defined below is the 
largest one can cover using properties of the graph’s spectrum. 

Theorem 2. Assume {di — ^ 2 )^ > 4(di + d 2 — 1). Then there is an efficient algorithm that allows 
strong recovery. 


Remark 1.1. In the case that di is even our graph is a special case of the so called “random lifts” 
and we can use their spectral properties (see Section 2.3). We will give the proof of Theorem]^ 
when di is even in the main body of the text and postpone the proof for any value of di to the 
appendix. 


The proof of the above theorem is broken into two parts. The first part uses a spectral argument 
to prove weak recovery. Formally we have the following lemma: 

Lemma 1. Assume {di — (^ 2 )^ > ^{di + d 2 — 1). Then there is an efficient algorithm that allows 
weak recovery. 

Lemma gives us weak recovery. Strong recovery is then achieved by recursively applying the 
majority algorithm where one simultaneously updates the label of each vertex by the majority label 
among the neighbors. That this can be done is again an example of the rigidity in this model, and 
highlights one of the main differences between RSBM and the classic SBM. It shows that for the 
former, existence of an efficient algorithm for weak recovery implies the existence of an algorithm 
for strong recovery. This contrasts with the separate thresholds in the SBM |28j . 

We present the majority algorithm in the section below. 
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1.0.1. Majority algorithm. Recall that A and B are the true communities. Let {A,B) be any 
partition (not necessarily an equipartition) of the vertex set. For each i G [2n], let cTj = +1 if i G ^ 
and (Tj = — 1 if i G B. 

Initialize = A,Bq = B. 

For i G [2n] (majority rule) 

(Ti = sign( aj) 

Vjr^Vi 

Return Ai = {vi : Oi = +1}, Bi = {vi : di = -1} 

Similar applications of the majority algorithm appear in [1] and [28j. There, the authors find 
criteria for both weak recovery and strong recovery in the SBM. It is not hard to see that weak 
recovery and strong recovery are not equivalent in the sparse SBM, since the presence of isolated 
vertices prevents strong recovery. 

Throughout the rest of the article we will refer to the majority algorithm as Majority. The 
following theorem, along with Lemma completes the proof of Theorem 

Theorem 3. Assume di > (i2+4. Then there exists an e = e{di) > 0 such that the following is true 
a.a.s.: given a graph ^(n, di, ^ 2 ) and any partition (A, B) of its vertex set such that | AnA.| > (1—e)n 
and |i?ni3| > (1 — e)n. Majority recovers the true partition (A, B) if started with {A,B), after 
0(log(n)) iterations. The constant in the O(-) depends on e,di. 

The way we iterate the Majority algorithm will be clear from the proof of Theorem see 
section 15.11 


2. Main ideas and organization of the paper 
In this section we sketch the main ideas behind the proofs and also the structure of the paper. 


2.1. Organization. There are five results we present in this paper. In section]^ we prove Propo¬ 
sition [1] and Theorem [H This section also contains a review of some standard definitions in the 
random graph literature that we make use of throughout the paper. We present an informal sketch 
of the proof of Theorem in section |2.2[ Section is concerned with proving Theorem when di 
is even. Section contains the proofs of Theorem as well as Theorem with no restriction on the 
parity of di. The proof of Lemma is deferred to the Appendix. Finally, we introduce some useful 
notions on random lifts and multigraphs in section 2.3, where we explain how to obtain Theorem 
[^when di is even. 


2.2. Sketch of the proof of Theorem[^ Recall from Definition]^ in the graph G := Q{n, ^ 1 ,^ 2 ) 
on [2n], (A, B) form the true partition. 

Let us introduce the following notation: for any V C [2n] let Gy denote the subgraph induced 
by G on V. For disjoint subsets Vi, V 2 , let denote the subgraph on Vi U V 2 induced by the 

edges in G with one endpoint in Vi and the other in V 2 . For any v G [2n] and V C [2n] let degv{v) 
denote the number of edges incident on v whose other endpoint is in V. 

Thus Theoremsays that, a.a.s., there does not exist any V C [2n] with R / A, B and \V\ = n 
such that the following two conditions hold simultaneously: 

• Both Gy and G[ 2 n]\y di-regular graphs. 

• G'(y,[2n]\y) is a d2-Legular bipartite graph. 

However we show that it is even unlikely that Gy is di-regular for any R A A, B with |R| = n. To 
this end we fix such a R and let Ri := Rn A, R 2 := R nB, and assume IR 2 I = an with a < ^. Note 
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that, given G, V and A, the degree sequence {degvi{v)}y^Vi is determined; if Gy were di-regular 
graph then for each v € V, 

degvAv) + degv2{v) = di, 


and hence the degree sequence {degv 2 {v)}v&V\ is also determined, i.e. the number of edges going 
from each vertex in Vi to V 2 is fixed. 

It can be shown using the configuration model (see Section 3.2 for the dehnition) that the joint 
distribution of {degv2{v)}veVi behaves like i.i.d. Bin{d2, a)'s. The proof now follows by using the 
above to estimate the probability of a certain degree sequence from this distribution, and by a 
union bound over all possible choices of V. We remark that the formal proof involves some case 
analysis depending on the size of | V 2 1 and relies on the expansion properties of regular graphs when 
IV 2 I is small. 


2.3. Sketch of the proof of Theorem when di is even. To prove Theorem when di is 
even, we make use of the recent work on the spectra of random lifts of graphs in mu and the 
references therein. For a wonderful exposition of lifts of graphs see [2]. We now introduce the 
notion of lift of a multigraph. 


2.3.1. Random lifts and multigraphs. By a multigraph we simply mean a graph that allows for 
multiple edges and loops. Next we define the notion of lift. Informally, an n-lift of a multigraph 
X = {V,E) is a multigraph Xn = {Vn,En), such that for each vertex in V there are n vertices in 
Vn and locally both graphs look the “same”. Formally, let Vn '■= V x {1,2,.. .n}. To define the 
edge set in the lift consider the set := {vrejeeE where vie G Sn (the set of permutations of [n]). 
We have: 


En := {((a:,f), (2/,7re(f))) : e = {x,y) £ E, 1 <i<n}, 

for vr G . Thus every edge in E “lifts” to a matching in En- For every v G V, let v x {1, 2,... n} 
be called the fiber of v. 

A random lift is the lift constructed from vr G 5® where {'Kgfie&E are chosen uniformly and 
independently from Sn- Let A and be the adjacency matrices of the multigraphs X and 
respectively. One can check that all the eigenvalues of A are also eigenvalues of A^ and the 
corresponding eigenvectors can be “lifted” as well to an eigenvector (which is constant on fibers) 
of the lifted graph. Let the remaining eigenvalues of An be, 

\hl\ > \h2\ > ■■■> \gr\, ( 2 . 1 ) 

where r = n\V\ — |B|. With the above definitions we now state one of the main results in |llj . 

Theorem 4. Let d > 3 be an integer and let X be a finite, d-regular multigraph. If Xn is a random 
n-lift of X then, for any e > 0, 

lim P(|^i| > 2\/d — 1 -|- e) = 0 . 

n^oo 

Recall the definition of strong and weak recovery from Section We also need the following 
definition. 


Definition 2. Let e := e 2 n be the vector of all ones of length 2n. Also let a = a 2 n be the vector 
of signs which denotes the partition A, B i.e. 


a{x) 


“hi X G 

— 1 otherwise. 
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The proof of Theoremfollows by first realizing the graph Q{n, di, ^ 2 ) as a random lift and then 
using the above theorem to show spectral separation of moreover, it can be shown that, with 
high probability, a in Definition is an eigenvector associated to the second eigenvalue of the lift. 
The proof of Theorem is now reduced to finding a good approximation to the unitary eigenvector 
corresponding to the second eigenvalue. Note that this allows the strong recovery of the partition 
{A,B). 


3. Proof of Proposition [T] and Theorem [H 

Let Kn be the support of fin, i-e., Kn is the set of all graphs which are di-regular on A and B and 
d 2 -regular and bipartite across, for some equipartition [A,B) of [2n]. Let \Q{n,d)\ be the number 
of d-regular graphs on n labelled vertices and let \BQ{n,d)\ be the number of d-regular bipartite 
graphs on 2n vertices. To show that —)• 0 we will use the following enumeration results 

that can be deduced from [22] and [23| . The idea is to count the number of points in the support 
of the measures and We have from |23l Corollary 5.3] : 

{nd)l 


\G{n,d)\ ^f^nd/ 2 )l 2 ^'iA{d\)^ ’ 

asymptotically in n, where C = C{n, d) remains bounded as n grows. Similarly, from 
2 ]: 

{dn)\ 


\Bg{n,d)\=Ci 


id 


y]2n 


(3.1) 
Theorem 

(3.2) 


asymptotically in n, for Ci = Ci{n,d) a bounded function. We have: 


f^n{dAn) — 


\K^ 


\G{2n, di + d 2 )\ 

To compute |iLn|, recall Definitionfirst choose A and then use ( |3.1| ) and (3.2). We get: 


r. 


{ndi)\ 


{nd 2 )\ 


nj \{ndi/2)\2^'^iA{diyAJ (d2)P^ 

(n(di + d 2 ))! 2 »(^i+'^^)(di + d 2 )!^^ 

(2n(di + d 2 ))! 

for C 2 = 6 * 2 (n, di, ^ 2 ) bounded as n grows. Using Stirling’s Formula we get: 



_|_ (^^jdi+d2 




2di+d2 J _|_ (^^yi+d2 J 


Where C 3 equals C 2 times a universal constant. Both fractions on the right hand side above are 
less than 1. This proves Proposition □ 


3.1. Uniqueness of the clusters. 


3.2. Preliminaries. For the sake of completeness, we include in this section some of the basic 
definitions in the random graph literature. Specifically, we define the conhguration model to sample 
random graphs and also the exploration process. 
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3.2.1. Configuration model and exploration process. The configuration model, introduced by Bender 
and Canfield [3] and made famous by Bollobas jl] , is a well known model to study random regular 
graphs. Assuming that dn is even, the conhguration model outputs a d-regular multigraph with n 
vertices. This is done by considering an array {^ij, i<i<d, l<j<n} and choosing a perfect 
matching of it, uniformly among all possible matchings. A graph on n vertices is obtained by 
collapsing all ^ij for 1 < i < d into a single vertex, and putting and edge between two vertices j 
and t for each pair {^ij,^ki) present in the matching. We refer to the family ^ij as half edges. 

It is not hard to see that under the condition that the resulting graph is simple, the distribution 
of the graph is uniform in the set of all simple d-regular graphs. Furthermore, it is well known that, 
for any hxed d, as n grows to infinity, the probability that a graph obtained by the conhguration 
model is simple is bounded away from zero. More precisely, denoting by G the resulting graph, one 
has (see 0), 

l-d2 

P(G is simple) = (1 — o(l))e 4 . 

Thus, to prove a.a.s. statements for the uniform measure on simple d-regular graphs it suffices 
to prove them for the measure induced on multigraphs by the conhguration model. 

One extremely useful property of this model is the fact that one can construct the graph by expos¬ 
ing the vertices one at a time, each time matching one by one the d half edges of the correspondent 
vertex, to a uniformly chosen half edge among the set of unmatched half edges. This process will 
be used crucially in many of the estimates. We include the precise dehnition for completeness. 

Definition 3. Consider the following procedure to generate a random d-regular graph on n vertices: 

— Fix an order of the vertices: vi < V 2 < ... < Vn and let E = {fij}, 1 < i < d and 1 < j < n, 
be the set of half edges, where, for any 1 < j < n, f^ij are the d half edges incident to vertex 
Vj. Consider the usual lexicographic order on H. 

— Construct a perfect matching of E as follows: the first pair is where ^ is chosen 

uniformly from Having constructed k pairs, let be the smallest half edge not 

matched yet, chose ^ uniformly from the set of remaining unmatched half edges different 
from ^ij , and add the edge {(,ij , ^). 

— Output a multigraph G, with vertex set {uj} and an edge set induced by the matching 
constructed in the previous step. 

This construction outputs a graph with the same law as the one given by the configuration model. 
Conveniently, with this construction we discover all neighbors of vertex m first, then we move to 
V 2 and expose its neighbors (it could be the case that some edges are connecting vi and V 2 and 
those were exposed before!) and so on. We will refer to this procedure as the exploration process. 
All the above dehnitions can be easily adapted to sample bipartite regular graphs as well, and in 
this paper we will use both sets of definitions. 

3.2.2. Proof of Theorem^ Recall that di > d 2 and that {A,B) are the true clusters. The idea, as 
discussed in Section will be to show that, conditioned on the choices of A and B, if we choose 
another subset of n vertices, the probability of having a di-regular graph on these n vertices is 
small. The estimate on the above probability is crucial since it will then allow us to take a union 
bound over all possible subsets of size n to conclude that, a.a.s., there is a unique pair of clusters. 

First we need some definitions. 

Definition 4. Given a graph G = {V, E), 

i. For a vertex v and a set of vertices S denote by degs{v) the number of neighbors of v in S. 
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ii. For any subsets Vi C V 2 C V define the boundary dy^Vi to be the number of edges in E 
whose one end point lies in Vi and the other in V 2 \Vi. When V 2 = V we use the simpler 
notation dVi. 

Consider non-empty subsets A C A, B C B such that U -B| = n. Without loss of generality 
assume |^| > |-B| and let a be such that 

an = \B\. (3.3) 

We will prove Theorem by showing that given the di-regular graph with vertex set A, for any 
choice of A and B the probability that is a di-regular graph goes to zero as n goes to infinity. 

We use the simple observation that since A is di-regular, to have Au B di-regular, for any vertex 
V £ A, the number of neighbors of u in i? must be equal to the number of neighbors of v in 
The technical core of the proof involves showing that the probability of this event is small. 

We start by proving a lemma. Recall that, in order to have a di-regular graph with vertex set 
AuB with A C A and R C it is necessary that degsiv) = degx\A{v) for all v € A. For notational 
brevity let 

gy := degjs,\A{v) (3.4) 

for all V ^ A. 


Lemma 2 . Given A d A,B <Z B and a sequence of non-negative numbers g = {gi, g2, ■ ■ ■, g\A\) let 

p{gi,g2, ■ ■ ■,g\A\) ■= ^{degB{v) = gv for all v ^ A). 

Then, for any such g, 

maxp(5'i, 52> • ■ ■; A|) = > • • ■; 5|A| )> 

where g* G {£, -|- 1} for some non negative number i = i{g). The maximum in the above is taken 
over all sequences g' = {g[, g'^,. .., 5 |^|) such that X]l=i g'i = E 1 =i 

The above lemma says that, given the total number of edges going from A to B, the probability 
of a possible degree sequence is maximized when all the degrees are essentially the same. Clearly 

; the number of (/ -|- 1 ) degrees occurring in 5 * = {gl,g 2 ,. ■ ■ ,g*A\) is determined by 
E^g*i =Eigi- 


I = 


l^l 


Proof. To compute p{gi,g 2 , ■ ■ ■ ■,g\A\) we use the exploration process for the ^ 2 —regular bipartite 
graph {A,B) where the vertices of A are exposed one by one, as sketched in Subsection |3.2.1 We 
order the vertices so that the vertices of A are exposed first. Let T) be the filtration generated 
by the process up to the vertex. Using the exchangeability of the variables degsivi), given a 
sequence {gi}, w.l.o.g. we can assume gi = min^j and 52 = max^j. 

Assume now 52-51 > 1- We will show that ^( 51 , 52 ,.. ■,g\A\) < p{gi + 1,52 - 1, • ■ -,51^1), which 
implies the lemma. We start with the following simple observation: 


F{degB{vi) = gi, i > 3 | F 2 ,degB{vi) = 51 , degB{v 2 ) = 52 ) = 

F{degB{vi) = 5 *, f > 3 I E2,degB{vi) = 51 + l,degB{v2) = 52 - !)• 


This is because under the above two conditionings the number of remaining unmatched half edges 
in A, A, B, B is the same. Hence it suffices to show that 


F{degB{vi) = gi,degB{v 2 ) = 52 ) < F{degB{vi) = gi + l,degB{v 2 ) = 52 - !)• 


(3.5) 
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Next we note that 


F{degBivi) = gi,degB{v 2 ) = 52 ) 


/ d 2 \ fd 2 \ (anrf 2 )[gi+g 2 ]((l - «) 11-^2)[2^2-31-92] 
\9i) \92) (.rid2)[2d2] 


where {x)m is the falling factorial (x)[m] = x{x — 1)... (x —m + 1). To see the above, we first choose 
those half edges of vi and V 2 that will connect to half edges in B. Then we choose the 2^2 half 
edges in B that will match with the corresponding half edges of xi and V 2 such that exactly gi + g 2 
are incident on vertices in B. 

Substituting now into (|3.5[) we have: 


p{9i, 92, ■■■9\A\) < P{9i + 1,52 



(51 + l)(c^2 - 52 + 1) < 52(^2 - 5i) 


51 - 52 + 1 < ^ 2(52 - 51 - 1) 


which follows immediately from g 2 >51 + 1 . 


□ 


Recall that we are interested in the probability that ^ U R is di-regular for a hxed choice of A 
and B. As already discussed, 


P(AU B is di-regular) < F{degjw^ji{v) = degB{v),'iv G A). (3.6) 

Our next goal is to bound the probability of such an event. To this end we recall the notion of 
stochastic dominance. 

Let ui and V 2 be two probability measures on Z, and let X ~ ui, T ~ V 2 . We use X ^ T to 
denote that i >2 stochastically dominates ui. 

Recall now Dehnitions and as well as (3.4). 


Lemma 3. Let M = min{c?^A, n/2}, and let Y = (Ti, 12, ■ • •, Lm) where Yi ~ Bin{d 2 , 2 a) are 
i.i.d.. Then 

M 

F{degBiv) = g^ , Vv e A \ A) < ]^IP(Tj > 1) • 

i=l 


For notational brevity, we have denoted by P(- | A) the random graph measure Q{n,di-,d 2 ) 
conditioned on the subgraph induced by A. 


Proof. First recall that by Lemma the quantity on the left hand side is maximized when for all 
V, gy G {£, t'+ 1}. Hence we assume that this is the case. Now to prove the lemma we consider the 
exploration process dehned above. The definition requires us to fix an order on the vertices of A] 
we do this in the following way. Consider the two cases: 

i. £ = 0 : First come all the vertices u, G A with gi = 1, followed by the remaining vertices in 
A. Then come all the vertices in A.\A. 

ii. £> 0 : First come all the vertices Xj G A with gi = £, followed by the remaining vertices in 
A. Then come all the vertices in A\A. 

Recall that T) is the filtration up to vertex i. Note that, for 1 < i < min(5_4A, n/2), 

and2 — {i — 1) 
nd2 —id2 

This follows from the simple observation that for any of the cases mentioned above for the 
vertex, there are at most {and 2 — (i — 1)) half edges in B that haven’t yet been matched. Now note 



degBivi)\Bi-i < Bin[d2, 
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that since by hypothesis i f, 


and2 — (i — 1 ) ^ and2 


nd2 — id2 nd2j2 

= 2a. 

Thus we are done. □ 

As already used in the proof of the above lemma, 

P(A U is di-regular | A) < p{i, + 1, + 1) 

for some i = (.{A, A). In case i. we see that by Lemma 

min{n/2,5^A} 

p(0,0,...,0,l,...,l)=p(l,l,...,l,0,...,0) < n P(yi>l) (3.7) 

i=\ 

min{n/2,5^A} 

< ll {2d2a) 

i=l 

The first equality follows by exchangeability. The first inequality follows from Lemma The 
second is a simple consequence of the fact that for a nonnegative variable the probability of being 
bigger than 1 is at most its expectation. 

In case ii by similar arguments 

n/2 

p{£A,. ..£,£ +I,...,£+1) < > 1) (3.8) 

i=l 

nl2 

< ]^(2d2a)- 

i=l 


Note that in (3.8) the term dj[A does not appear. This is because in this case by hypothesis 


n 


\dAA\>£\A\>-. 

To proceed with the proof of Theorem we quote two standard results on the expansion of random 
d-regular graphs. Let 7 be the spectral gap for the operator of the random walk in the uniform 
random regular graph G G G{n,d), be.: 

7 = 1-^ (3.9) 

where A 2 is the second largest eigenvalue of the adjacency matrix of G. 

Theorem 5. [IHl Theorem 1.1] With probability going to 1 as n —>■ 00 , 

The next result was proven independently in m and m- We will use it as it appears in [T9l 
Theorem 13.14]. 


Theorem 6. Let G be a d-regular graph in n vertices. For any S C V{G), with IS"! < 

2 - d\S\' 
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Putting everything together we get the following: For di > 16, a.a.s., for all S' C ^ with \S\ < 

> ^\s\. 


In particular since |^| > n/2 it follows that, a.a.s.. 


di, 


In case i. {£ = 0) plugging ( |3.10 ) in (3.7) we get 

min(n/2,|5_4A|) 


(3.10) 


\Sa^\ 

2 


U B is di-regular I < 


n 

i=l 


) > 1) < n ^ 


(3.11) 


2=1 


8 


< n ^ 
2 = 1 


assuming that the di-regular graph on A satisfies (3.10). The second inequality follows from the 
simple observation that since £ = 0, we have \dA\ < n. 

Recall that we want an upper bound on the right hand side of 3.6 Combining Lemma ( |3.8[ ) 
and (3.11) we get 

P(A U R is di-regular I ^) < P(y > l)^"*"-h P(y > 1)’"/^. (3.12) 

The two terms on the right hand side correspond to the two cases (. = Q and ^ > 1. 

Next we show that the bounds in ( 3.12[ ) are good enough to be able to use union bound over all 
possible choices of A and B. There are (^)^ ways to choose A and B. Denote by Ra the event 
that ^ U R is di-regular for at least one choice of A and B. Thus by union bound, 

2 


T(Ra) < 


n 


an 


p(y > i)i^“’" -hp(y > i)’"/2 


(3.13) 


We now estimate the right hand side using Stirling’s formula. Let 

H{x) = —X log X — {1 — x) log(l — x) 


be the binary entropy function. Then the two terms in the right hand side of (|3.13|) are at most 

2n,[2-H(o) + l 


2n[2H(«) + ^Q:log(P(y>l))] 


and 


iQg(p(y>i)) j 


/an 


'an 


up to universal constants involved in Stirling’s approximation. Our goal would be to upper bound 
the two exponents, 


2H{a) + ^alog(P(y > 1)) and 2H{a) + ^ ^)) 


(3.14) 


Recall that a was defined in (3.3). Consider the three following cases: 
— di 


CASE 1: a < 1 


In this case we will use the bound P(y > 1) < 2 d2a by LemmaPlugging this in (3.14) we get 
the following upper bounds 

2H{a) + ^alog{2d2a) and 2H{a) + d 2 a) ^ 

8 


2 
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Now, 


2H{a) + —alog(2d2a) = —2alog(a) + —alog{2d2a) — 2(1 — a)log(l — a) 

8 8 

< alog(a)(^ - 2) - 2(1 - a) log(l - a) 

< alog(a)(^ -4). 

To see the above inequalities first note that since a < ^, log(2d2a) < as soon as d 2 > 4, and 

also 1(1 — a) log(l — a)| < 4a. Similarly for large enough d 2 we have 


2 ff(a) + 


log(2d2a) 


Thus for large enough d 2 < di 


Hence 


= —2a log(a) + 
log(a) 


log(a) 


— 2(1 — a) log(l — a) 


< 


16 


F{Ro,) < 


23q log(Q)n 


an 


23ck log(cK)n 



< n2"3n 

1 


< 


n 


(3.15) 


where a G Ii = (0,^). The last term is derived using the following: The function a log a is 
“2 

decreasing from 0 to 1/2 and the least possible value of a = Plugging this value of a we get the 
above. 

1 


c_ 

“ ^^2 ‘ 

Now clearly in this range of a, by stochastic domination F{Bin{d 2 ,a) > 1) is maximized when 
a = ^. We now use the Poisson approximation of Bin{d 2 , to bound the probability P(y > 1) 
by a universal constant c which is a function of C for all a in this range. Using this, we rewrite 


CASE 2: A < a < ^. 


(3.13) to get 




2 H(a) + log(c) < —2 q; log(a) + log(c) — 2(1 — a) log(l — a) 

8 8 

< -4alog(Q;) + ^Q;log(c) 

8 

< —5a 


for large enough di. Similarly for large enough d 2 we have 

2 H(a) +'^< 


2 


4 











RECOVERY AND RIGIDITY IN A REGULAR STOCHASTIC BLOCK MODEL 


13 


Plugging in we get 


—5an 



-^ 71 


where I 2 = ^]- Thus the proof for the case when a < ^ is complete. 


< n2 2 


c 


(3.16) 


CASE 3: g < a < i. 


We first need a preliminary lemma. For ^2 £ N and a € (0,1) let 2 ^2,p ~ Bin{d 2 ,p)- 
Lemma 4. There exists a constant Ci such that for all large enough d 2 

sup sup ^{Zd2,a =i) < 
riPl-Pl Z-I l<i<d2 4UU 

^2 ’ 3 ^ 

Proof. It is a standard fact that for any d 2 , ol 

sup ¥{Zd 2 ,a = i) = ^{Zd 2 ,a = [{d 2 + !)«])• 

I<i<d2 

Let k = [{d 2 + 1 ) 0 ;]. We now estimate 

FiZd2,a = k)=(^^^y'^il-af^-'^. 

Since k > Ci hy hypothesis using Stirling’s formula we have 

F{Zd2,a = k) = 0 

“ 400 

for large enough Ci. □ 

We now need another lemma. Consider the exploration process for sampling the bipartite regular 
graph given by A, B (sketched in Definition]^, where vertices of A are exposed one by one to find 
out the neighbors in B. We do this first for each half edge incident to the vertices in A, followed 
by the half edges corresponding to the rest of the vertices in A. Let us parametrize time by the 
number of half edges. Consider the Bernoulli variable 

Bt = l(the half edge is matched to a half edge in B). (3-17) 

Now note that the first d 2 half edges correspond to degsivi), the second d 2 half edges correspond 
to degB{v 2 ), and so on. We now make a simple observation that the Bernoulli probabilities do not 
change much from time t to t + d 2 . This then shows that degB{vi) are essentially Binomial variables 
with probability depending on the filtration at time {id 2 ). Formally, we have the following lemma: 
let Bi be the filtration generated up to time {id 2 ) (when all the half edges up to vertex i have been 
matched). 

Lemma 5. For any i < j there exists a pi which is Wj_i-measurable such that 

\\degB{vi)\Bi-i, Bin{d 2 ,Pi)\\TV = O 

where H-, - Wtv denotes the total variation norm and the constant in the O(-) notation depends only 
on ^ 2 - 
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Proof. To show this first note that the random variables Bt in (3.17) are Bernoulli variables with 
probability 

and 2 - Bj 


Pt = 


nd2 — t 


Then clearly for all t < \pt — Pt-i\ < ^ • The proof thus follows since 

degsivi) = ^ Bj . 

(i—l)d2<j<id2 


Recall i from Lemma Now suppose U R is di—regular. Then by definition 

1^1 


□ 


t\A\ < Y. degsVi < d2\B\ = and2 


2 = 1 


i < 


a 


1 — a 


d 2 < 2 ad 2 . 


Using the above we get that for all j < 


and 2 — j'(£ + 1) ^ and 2 — ^(80^2) ^ a 


(3.18) 


nd2 — jd2 nd2 4 

Above we used the fact that I + 1 < 2ad2 + 1 < 3 ad2 since ad 2 > C > 1 hy hypothesis. Also 
clearly for j < n/4, since a < 1/2, 

and2 — ji 


Assume that all the degB{vi) G + 1}. We have the following corollary. 


(3.19) 


Corollary 1. For all 1 < f < n/4, if degsivj) G {£,!’ + 1}, for some (. < 2d2Ct for all j < i then 
there exists pt which is measurable such that 

\\degBivi),Bin{d2,Pi)\\TV = O (- 

\n 

where f < Pi < 2/3. 

Proof. The proof is immediate from (3.18[), (|3.19) and Lemma 


□ 


We now complete the proof of Theorem in the case « £ ^3 = [^, ^]. Using the same notation 
we used before we have: 


< EE F{degBivi) = gi) 

^ \anJ 400 "-/^ 
ae /3 


< 


an 400^/4 


■)2r2 


< n 


400’^/4 ■ 


(3.20) 
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The first inequality is by the union bound. To see the second inequality observe first that by Lemma 
[^it suffices to assume that g^s G {£,.£+ 1}. Thus the second inequality follows by Corollary and 
Lemma m as soon as 


a > ^ 
4 ^ 


d2 


which we ensure by choosing C > ACi. 


Thus combining (3.15), (3.16) and (3.20) we have shown that 




for some t = T{d 2 ) < 1. Hence we are done. 


□ 


4. Theorem [2] and connection to the min-bisection problem 

Throughout this section we always assume di is even. We first remark that, under the hypothesis 
of Theorem one can make a quick and simple connection to the min-bisection problem. It turns 
out that, in the case of the RSBM, the two problems are equivalent. More precisesly, in the proof 
of Theorem below, we show that the second eigenvalue of Q{n,di,d 2 ) equals di — d 2 with high 
probability, which implies that 7 = 777 ^ where 7 is the spectral gap defined in (3.9). Hence, it 
follows by Theorem]^ that the size of the min bisection of G{n,di,d 2 ) is at least nd 2 - Since the 
true partition (M, B) matches this lower bound, it solves the min-bisection problem. 

We now proceed towards proving Theorem for di even. Recall the notion of random lifts from 
Section 2.3.1 We will now connect G{n, di, ^ 2 ) (RSBM) with random lifts of a certain small graph. 


Consider the following multigraph on two vertices: u and v, with d 2 edges between u and v and 
di/2 self loops at both the vertices (recall that di is even). To randomly n—lift the above graph 


di 

2 


Figure 1. Multigraph lifting to Q{n,di,d 2 )- 



times 


according to Section 2.3 we choose uniformly di -|- d 2 many permutations: 


VTl, 712, 


' Tlrfi, VTl, 7125 




(4.1) 


from Sn- 

Let the lift be ^{n, di, ^ 2 ) on the vertex set {u, u} x {1, 2,... n}. We naturally identify it with 
[2n] = {1, 2, 3, 4,... 2n} with the first n numbers corresponding to u x {1, 2,... n} and the rest 
corresponding to v x { 1 , 2 ,... n}. 

Note that the subgraph induced by f#(n,^ 1 ,^ 2 ) on [n] has edge set {i,7rj{i)) for i G [n] and 
j G [di/2\. Similarly ^ 2 , on [2n]\[n] has edges {n + i,n + for i G [n] and j G [di]\[di/2]. The 

edges between [n] and [2n]\[n] are the edges (i, n-|- 7 r'(i)) for i G [n] and j G [^ 2 ]- Recall Q{n,di,d 2 ) 
from Definition A standard model to generate regular graphs is the well known configuration 


model, as also used in this article (see Section 3.2) Now notice that i^(n, di, ^ 2 ) is essentially the 
same as G{n,di,d 2 ) except the graphs are now generated using permutations in (4.1). This is 
known as the Permutation model (see m and the references therein). We now use a well known 











16 


BRITO, DUMITRIU, GANGULY, HOFFMAN, AND TRAN 


result which says that the two models are contiguous, i.e. any event occurring a.a.s. in one of the 
models occurs a.a.s. in the other one as well (see pdQ 

We now prove Theorem Let the graph in Figure be called ^. The adjacency matrix of ^ is 


: — 


di d2 
d2 di 


with eigenvalues di + d 2 and di — d 2 and corresponding eigenvectors 


1 

1 


and 


J, respectively. Let be the adjacency matrix of l^(n, di, ^ 2 ) which as discussed above 
is a random n—lift of From the discussion in Section [2.3| we have the following: 


• di + d 2 and di — d 2 are eigenvalues of A^^n, with eigenvectors e and a respectively (see 
Definition!^. 

• By Theorem for any e > 0, a.a.s., all the other eigenvalues A of satisfy |A| < 
2\/di + (i2 ~ 1 + £• 


Let An be the adjacency matrix of G{n, ^ 1 ,^ 2 )- That the first fact above holds for An as well is easy 
to check. Moreover, using the contiguity of the two models, An also has the second property a.a.s.. 
Note that finding the partition (M,i3), in Definition is equivalent to finding cr, (the eigenvector 
corresponding to the eigenvalue di — ^ 2 )- Now under the hypothesis of Theorem]^ by the above 
discussion we see that di — d 2 is the second eigenvalue which is also separated from the first and 
rest of the eigenvalues. Thus, we can efficiently compute a unitary eigenvector, re, associated to 
this eigenvalue. To assign the communities, put v G A A and only if > 0. Strong recovery is 
then achieved. This proves Theorem 


5. Complete reconstruction from partial reconstruction: proof of Theorem [3] 

AND Theorem [2] 


In this section we prove Theorem The idea is to show that, because of the rigid nature of 
the graph, if we initialize the partition with a large number of vertices labeled correctly, one can 
bootstrap to deduce the true labels of even more vertices in the next step. We do this by looking at 
the majority of a vertex’ neighbors. Recall Majority from Section [l. 0.1 We prove that with high 


probability the graph Q{n, di, ^ 2 ) is such that if the input (^4, B) has a large overlap with the true 
partition {A,B), then one round of the algorithm reduces the number of wrongly labeled vertices 
by a constant factor. Thus it follows then that, with high probability, after 0(log(n)) iterations, 
no further corrections can be made and the algorithm outputs the true communities. 


Lemma 6. Assume di > ^2 + 4 and let 1/2 < A < 1. Then there exists an e = e{di) > 0 such 
that, with probability 1 — 0(n^/^“^), the graph has the property that if {A, B) (the input) satisfies 
min{|A A\,\B f\B\} > (1 — e)n and if IMfj -^1 =: k and \B[\ A\ =: k', then 

• 

where (Ai,i3i) is the output after one round of Majority. 

The constant in O(-) depends on di, A, e. 


Proof. Let v G Af^Bi (that is, v has the wrong label after one iteration of Majority). We claim 
that V has more than two neighbors in AP\B, otherwise v will have at least di — 2 neighbors in 


^ |13l Theorem 1.3] actually shows contiguity of regular graphs under configuration model and the permutation model. 
Note that Q(n,d\,d 2 ) and ^{n, di, d 2 ) are constructed from three independent regular graphs constructed using the 
configuration model and the permutation model. Since contiguity is preserved under taking product of measures, 
Gin, di, d 2 ) and 5f(n, di, d 2 ) are contiguous. 
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Af^A and hence its label will be the sign of: 

>di-2-{d2 + 2)>0 , 

i'^v 

which contradicts the assumption that v ^ A[\Bi. Thus the occurrence of the event |^P| -Bi| > 
\k implies the occurrence of the event 

Ek := {3 a subset S C |5| = AA: : any v £ S has at least three neighbors in Alf] • 

Hence an upper bound on the probability of the event Ej^ will be an upper bound on the failure 
probability for Majority to reduce the size of the set of incorrectly labeled vertices in Al by a 
fraction 1 — A. 

We compute now an upper bound on the probability of Ef^. By the exploration process (see 
Definition Isj) it follows that for vertices in the set S, the degree sequence {de( 7 (_ 4 p| 5 )(i;)}^g 5 is 
stochastically bounded by a vector of i.i.d. binomial random variables i.e., 


{deg(^_Af^B)iv)}ves ^ {Zv}v&s , where ~ Bin{di, 


k 


n — Xk 


By stochastic domination of vectors we mean the existence of a coupling of the two distributions 

such that the one vector is pointwise at most the other vector. As ¥{Z^ > 3) < (^)^ by union 

bound and counting the number of choices for all the possible sets A{^B ol size k and S of size 
Xk, we obtain the following: 


nEk) < 

Adding over all possible fc, we obtain 


n 


n 


k/ \Xk/ \n — Xk 


dik \3Afc 




> AA; I k < en 


< 


E 

k=l 


n 


dik \ 3 AA: 


Xk)\n — Xk 


< 


(5.1) 

(5.2) 


dl^e^+^ xfc/feN(2A-i)fc 
^ VA^(1-A)3^/ \n) 

/ \m 

The last inequality follows by using the bound (^) < ( ^ ) , as well as the fact that n—Xn < n—Xk. 

^3Agl+A 

Denote now by c = c{di) := 

We show now that the sum in (5.1) is We split this sum into two parts, Pi and P 2 , 

the first representing the sum of all the terms corresponding to indices up to [\/nJ, and the second 
part representing the rest. For Pi, we obtain that 


= E 

k=l 

00 

E 




C I - 
n 


) 


k=l 


< 


< 


k=l 
2c 

n^-V2 


A-1/2 


n 
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The last inequality is true for large n. To bound P 2 , we note that k/n < e and we write: 

.nJ 


A= E 

k=l^/n\ 


< E 

k=l\/^ 


) < 


1 — ce 


2A 




2A-l\rVn] 


The last inequality above follows by choosing e so that < 1. Hence the probability of event 

Ek is O . As the problem is symmetric in A and B, it follows that a similar bound can be 

found for the event that . Thus by union bound, the probability of both events is 

also O and the proof of the lemma is complete. □ 


5.1. Proof of Theorem Let e = e((ii) as in Lemma Initialize Majority as {Aq,Bq) = 
{A,B) where A,B satisfy the conditions of Lemma Denote by {Ai,Bi) the partition after the 
j^th iteration of Majority where Ai corresponds to the vertices labeled +1, i.e., [Ai,Bi) is the 
output of the algorithm when we initialize it with (Aj_i,Hj_i). Consider the random variables 
Xi = max{|A.nI'® n^*|}- Note that {Xi = 0} iff A = Aj (and thus B = Bi). Also by the 
hypothesis Xq < ere, so Lemma implies that 

P(Ai < VA; , V 1 < i) > 1 - 


Let now t = 


logjen) 1 
log A 


Since the AjS are integer-valued random variables, we have 


F{Xt = 0) > 1 - 0(re^/2-A^ ^ 


which proves the theorem. □ 

Proof of Theorem The proof is a straightforward corollary of Lemma and Theorem □ 
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6. Appendix: Proof of Lemma [T] 


IS 


Recall that we have used spectral properties of random lifts to prove Theorem when di 
even, see section]^ The general proof relies on studying the matrix of self-avoiding walks (formally 
defined below) of the graph Q{n,di,d 2 ). This is the same matrix used in j2T] to prove the blockmodel 
threshold conjecture. This section adapts the techniques in that paper to the regular setting to 
prove Lemma 

In the case of the RSBM, the lack of edge independence increases the complexity of many of the 
calculations. On the other hand, the rigid nature of the model forces certain other calculations to 
be much easier for e.g. the size of small neighborhoods. 

The key ingredient in the proof of Lemma will be Proposition Its proof hinges on two 
technical lemmas we present below. We give here the proof of Proposition!^ subject to these two 


lemmas, whose proofs we defer to Sections 6.1 and 6.2 


Let us recall the definition of a self-avoiding walk on a graph G. Given two vertices i and j and 
a length / > 0, a self-avoiding walk from i to j of length I is a graph path {i = vo,vi, ...,vi = j) 
such that |{uo, vi ,..., u;_i}| = 1. 
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We denote by the matrix whose entry equals the number of self-avoiding walks of length 


I between i and j, for all 1 < j < 2n. 


Definition 5. We say that the sequence of unitary vectors {vn}n>i is asymptotically aligned with 
the sequence of unitary vectors {wn}n>i ib 

lim I < Vn,Wn >1 = 1- (6-1) 

n^oo 

Definition[^means that, asymptotically, Vn and Wn are the same up to a factor of —1. Throughout 
the rest of the article let 

> A2 > A3 > . . . > \ 2 n (6-2) 

be the eigenvalues of , 


Proposition 2. Assume {di — ^2)^ > 4 (di + d2 — 1). Let I = clog(n) where c is a constant such 
that clog((ii -1-^2) < For any fixed e > 0 the following three events happen with high probability 
as n grows: 

(a) Ai = {di + d2){di + d2 — 1)*“^ -|- o(l) and any unitary eigenvector associated to Ai is 
asymptotically aligned with e (the vector of all ones). 

(b) There exists a constant A > 0 such that A2 = Aa\l + o(l)), where 

di — d 2 + \/ {di — ^2)^ — 4 (di + d 2 — 1 ) 

« - 2 ’ 

any unitary eigenvector associated to A2 is asymptotically aligned with a (the vector of 
labels). 

(c) |Aa;| < IA3I < n‘^{di + d2)^/^(l -I- o(l)), for all 3 < /c < 2 n. 

Remark 6.1. Note that as {di — (^2)^ > 4 (di + d 2 — 1 ), 

di — d2 + sj{di — ^2)^ ~ 4((ii d2 — 1) > di — d2 + 1 > 2\/d^^-\-li2^^ + 1 > 2-^cii -|- <^2 > 
the latter inequality being true as di -|- ^2 > 6. This, in turn, means that 

o = + -^2 - 1 ) ^ ^ 

and as I = O(logn), by picking 0 < e < 1 — 4 clog(di -1-^2), ( | 6 . 3 [ ) is enough to show that 
lim I A3 1 /I A2 1 = 0, so the first two eigenvalues of are separated from the bulk. Also note 

n^oo 

that a < di — ^2 < di + ^2 — 1, so Ai and A2 are also separated from each other. 

From part (6) of Proposition one can see how to construct a labeling that recovers at least 
(1 — e)n vertices correctly, for any e > 0 and n = n(e) large enough using an eigenvector associated 
to the second eigenvalue. 

In order to understand the nature of the neighborhoods of G{n, di, d2), we will need the fact that 
cycles are few and far from each other, with high probability. To this end, we introduce the notion 
of tangle-freeness^ which by now is a standard concept in the random graph literature. For e.g. see 
uni and |20]. 

For a vertex v in G{n, di, d2), and for t G N let the ball of size t centered at v be denoted by 

Bt{v) = {u G G{n, di, d2) : d{u, v) < t} , 

where d{u,v) is the graph distance between vertices u and v. We define the boundary of Bt{v) by 

dBt{v) = {n G Q(n,di,d2) : d{u,v) = t} . 
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Definition 6. A graph G is said to be l-tangle-free if for any vertex n in G the ball Bi{v) contains 
at most one cycle. 


As we will see, with probability going to 1 the random graph Q{n,di,d 2 ) is Z-tangle-free for 
I = clogn for a small enough constant c (the c in the statement of Proposition works). 

The next two lemmas contains estimates for the leading two eigenvalues and the corresponding 
eigenvectors which imply spectral separation. The proofs we defer to the next sections. 

Lemma 7. Let the matrix of self-avoiding walks of length I = clog(n). Recall e and a from 
Definition . Assume G is l-tangle-free. With high probability, the following two events happen: 

(i) S^^'^e = {di + d 2 ){di + d 2 — l)^“^e + e for a vector e such that ||e ||2 = o(n), 

(ii) there exists a constant A = A{di, ^ 2 ) £ 1^ such that = Aa’‘{l + o(l))cr -|- d for a vector 

Vj-Uj-II-II f \ jr di-d2+y/{di-d2)^-4:{di+d2-l) 

a such that ||(T ||2 = o[n), and tor a =-^-■ 


Lemma 8. Let 1 < m < Z, and all the notations as above. For any unitary vector x such that 
x'e = x'a = 0 the following holds with high probability: 

< (Z + l)n^(di+d2)'"/'(l + o(l)) ; 

here b = clog((ii + CZ 2 ) <1/4 and e is as in Proposition]^ 


Using these two lemmas, we can prove the main result of this section. 

Proof of Proposition^ Recall (6.2). Let {tc,} be an orthonormal basis of eigenvectors, with Wi 
associated with Aj for all 1 < i < 2n. From Lemma and Lemmawe have, with high probability. 


sup \x'S^^'^x\ = {di + d 2 ){di + d 2 — if ^ + o(l) , 

|x|=l 


which implies that, with high probability, Ai = (cZi + d 2 ){d\ + d 2 — if ^ + o(l). 

As (T T e, it follows that 

sup \x'S^^'^x\ > \a'S^^'^a\ = Aafl + o{l)) ; 

\x\=l,xJ-e 

on the other hand, Lemma and the Courant-Fischer theorem (see [U]) guarantee that 

sup \x'S^^'^x\ = odu'S^-^Vl) . 

|a:|=l,ir_Le,cr 

This yields that, with high probability, A 2 = Aq;*( 1 + +o(l)). Finally, Lemma also yields that 
|Afc| < (Z -|- l)n'^(cZi -|- ^ 2)^(1 + 0 ( 1 ))) for all k > 3. 

We address now the issue of eigenvector alignment. Recalling the definition of alignment (6.1), 
let e = (\/^)“^e. We can write 

2n 

'^CiWi , 


e = 


2=1 


with ^ ^ norm). Our goal is to prove that ci —)• 1. Note that w = Ylt =2 

is perpendicular to wi. Therefore we can write, as per Lemma 

((Zi + d2)(tZi+cZ2-l)'“^+o(l) = ||5^'^e||2 < ||5(')cin;i||2 + ||5(')u}||2 

< ci(cZi -|- d2){di d2 — 1)^ ^ + 0(1) + IA2I , 


2n 


Wi 


where the last inequality is due to the Courant-Fischer theorem. As A 2 /A 1 —)• 0 with high probability 
as n —)> 00 , it follows that, again with high probability, ci —)> 1 and e and wi are asymptotically 
aligned. 
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Similarly, we show that a = and W2 are asymptotically aligned; as a and e are 

orthogonal and we just proved that e and rci are asymptotically aligned, if we write a = Yli=i > 

2 n 

it follows that lim ai = 0 . Let w* = Y] aiWi. 

72—^OO ^0 

Note that 

oiAi = Ai(ri;i, d) = d) 

= (tci, S'^^^d) = Aa\l + o(l))(t(;i, d) + o(l) 

= aiAQ;*(l + o(l)) + o(l) . 

The above implies that ai(Ai — Aa\l + o(l))) —)■ 0 , and since Ai ;:?> Aa\ it follows that we have 
the much stronger statement aiAi —?■ 0 . 

Now we use Lemma [3 to write: 

Aa\l + o(l)) = ||S'*^^^d||2 = ||5'‘^^^aitci||2 + | |5'*^*^a2tC2| I2 + ||*S'*^^^t(;*||2 
< uiAi + 0(1) + a2^Q:^(l + 0(1)) + IA3I , 

since IA3I <C Aa^ by Lemmaj^ and we just showed that aiAi —)■ 0 as n —)• 00, it follows that 02 —)• 1 
as n —7- 00, and with high probability the vectors a and W2 are aligned. This completes the proof 
of Proposition □ 


As mentioned, the rest of this section is dedicated to proving Lemmas and In Section 6.1 


we give a clear description of the neighborhood structure of G{n,di,d 2 ) which leads to the proof 
of Lemma and in Section |6.2| we use this nei ghi jorhood structure to obtain spectral bounds for 
SW via the moment method, and prove Lemma 


6.1. Proof of Lemma We start by analyzing the local neighborhoods in G{n,di,d 2 ) The next 
lemma establishes two important properties of G{n,di,d 2 )- with high probability, the graph is 
l-tangle-free for I as in Proposition!^ and the number of cycles of length less than I is small. 

Lemma 9. Let G ~ Q{n,di,d 2 ) and I = clogn such that 5 := clog(di + ^ 2 ) < and let 
0<e<l — 4Jbea small constant. Then 


(a) G is l-tangle-free with probability 1 — 0{n ^). 

(b) Denote by g y{G) : Bi{v) contains a cycle}. Assuming that G is l-tangle-free, 

> n^) < 0{n-^) . 


Proof. The first part of the lemma already appears as [2nt Lemma 2.1]. To prove the second part 
we use the following standard variant of the exploration process mentioned in Definition (This 
variant is also used in the proof of [20] Lemma 2.1]) . Choose a vertex u of G and fix some ordering 
among all other vertices. Consider the process that (in accordance to the ordering) exposes the 
neighbors of v , then reveals the neighbors of the “exposed” vertices, etc., until we have explored 
Bij^i{v). Note that we always expose all neighbors of dBs{v) before any neighbor of a vertex in 
dBs + i { v ). 

Consider the events Tr{v) = {Br{v) is a tree} for 0 < r < /. Since the events Tr{v) are nested 
and P(ro(u)) = 1, we conclude that 

i-i 

• (6.4) 

r=0 

As we construct Tr+i{v), at each step, half-edge choices for the next match that do not create 
cycles are all those belonging to vertices not yet considered. There are fewer than {di + ^ 2 )^^^ 
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vertices that have been considered so far, for a total of less than (di +^ 2 )^"*"^ possible bad matches. 
Hence, for Tr+i{v) to hold, we have for each of the (di + d 2 ){di + d 2 — 1)’’“^ < (di + ^ 2 )^, vertices 
at the level, at least n{di + ^ 2 ) — (di + ^ 2 )^"''^ choices for the half edges, out of the maximum 
possible n(di + ^ 2 )- (In fact we have fewer than n{di + ^ 2 ) possible choices remaining; however, 
since r < I = I = clogn, for c small, we still have n(di + d 2 )(l — o(l)) possibilities, at every step.) 
This means that 


nTr+i{v)\Tr(y)) > 


n{di + ^ 2 ) - {di + 

n(di+d2) / 


{di+d2Y 


By (6.4), 


FUiW)> 




Taking logarithms, we obtain 

r=l—l 


log(P(T,M)) > 'l:V.+<i2)Mog(l-<h±*)T) = _(l + „(l))'g‘(!il±^ 

n n 

r=0 r=0 


^2r+2 


— —(1 + 0 ( 1 ))^-^-— —(1 + o(l))(di + d 2 )n 


r=0 
25-1 


This implies that P(T;(i;)) > 1 — 0{rr° for n large enough. Hence 


n 
25-1^ 


= ^E(r,(u)'^) < 0(n-2^). 
The results follows using Markov’s Inequality. 


□ 


We are now ready to prove Lemma 

Proof of Lemma^ Let T = {u G V{G) : Bi{v) is a tree}. Observe that if Sul > 0 then v G Bi{u). 
Furthermore, if tt G 7~ then 


If u G r. 


c(0 ^ J 1 if ^ ^ dBi{u)- 

[0 else. 

(5«e), = \dBi{v)\ = (di + d 2 ){di + d 2 - ly-^ . 


(6.5) 

( 6 . 6 ) 


Write 

5^6 = (di + d 2 )(di + d 2 - l)^-^e + e , 

where e is an error vector and note that, from ( |6.6[ ), Ct, = 0 if i G T. Note that for all u and v we 
have < 2, otherwise we have more than one cycle in Bi{u), which contradicts the assumption 
that G is ^-tangle-free. Using that \Bi{u)\ < (di + ^ 2 )^ we have for v ^ T: 

e^, < 2(di + ^ 2 )^ 

Lemma[^(6) implies that: \P^\ < with high probability. Finally, by our choice of 6 in Proposition 
[ 2 ] we conclude: 

||e ||2 = o(n) 


This proves part (i). 
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The calculation for part (ii) is slightly more complex. For a fixed vertex v and every 0 < k < I 
let 

Xk{v) ■= : d{v,w) =k,a^= o-^}| , yk{v) := |{t(; : d{v,w) = k,aw = 

and let 

Zk{v) := Xk{v) - Vkiv). 

Thus, Xk{v) counts the number of vertices in the boundary of Bk{v) with the same label as v and 
similarly, yfc(u) counts the vertices in the boundary of Bk[v) with label —a{v). The importance 
of these quantities is reflected in the following observation: if v,v' £ T then Xk{v) = Xk{v') and 
yk{v) = ykW) for all 0 < fc < f, so Zk{v) = Zk{v'). Also, for any vertex v, 

(5^0-)^ = = {xi{v) - yiiv))ay = zi{v)ay . 

W 

Since with high probability all but a negligible number of vertices are in T and hence have the same 
zi{v), this relation suggests that a is almost an eigenvector. We make this understanding rigorous 
in the claim below. 

Claim 1. With the notation introduced before the following holds with high probability: 

a) Sdla = zia + a where zi = zi{v) for some (any) v £T, and ||5'||2 = o(n). 

b) Assume that the equation — (di — d 2 )x + (di + ^2 — 1) = 0 has two distinct real roots 
(which is equivalent to the condition {di — ^ 2 )^ > 4(di + ^2 — 1)) and denote the biggest root 
by a (trivially, a > 0). Then there is a real constant A > 0 such that: zi = Aq;^( 1 + o(l)) 
as n —)• 00 . 

Proof. To prove part a) of the claim, let a = — zia. We have = 0 if v £ T', else 

dv < |(5'('V)t,| + zi{v) < 2\Bi{v)\ < {di + ^ 2 )^ • 

By Lemma|^ \'T‘^\ < n^, with high probability, where <5 < j. Note that = {di + d 2 )^ and since 
(di + d2)' = 0 ( 74 ^/*^), we can conclude that, with high probability, 

||ct ||2 < n'^(di + d 2 )* = o{n) . 

To prove part b), we actually compute zi. We do this by finding a recurrence for Xk and yk, 
which leads to a recurrence for Zk, which we can solve. 

Consider a (di + d 2 )-regular rooted tree and the following labeling process on it: the root is 
labeled as +1. Among its neighbors, choose di vertices uniformly and label them +1, and label the 
others —1. Continue the labeling process in such a way that for each vertex w in the tree, exactly di 
neighbors have the same label as v. Denote by Xk (respectively, yk) the number of vertices labeled 
+1 (respectively, —1) at distance k from the root. We have: 

xi = di, yi = d2, X2 = dl + dl - di - d2, 7/2 = 2did2 
Fix k > 3; we have that 

Xk = diXk-i + d2yk-i - (dl + d2 - l)xk-2 ■ 

To see this consider edges going ‘out’ of the {k — l)th level whose other endpoint is a +1. Clearly 
number of such edges is 

diXfc_i + d2yfc_i. 

Now to compute the number of +l’s at the fcth level one needs to subtract the number of edges 
going from A: — 1 to a +1 vertex in level k — 2 since all the vertices at level k have exactly one edge 
connecting to level k — 1. Now number of edges between k — 2 and k — 1 where the vertex at level 
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A: — 2 is a +1 is Xk- 2 {di + d 2 — 1) since each vertex at level k — 2 have exactly + ^2 — 1 edges 
going down. 

By symmetry, using the same counting argument as above, we can obtain that 

Vk = diVk-i + d 2 Xk-i - {di +d 2 - l)yk -2 ■ 

Subtracting the two recurrences we obtain the recurrence for Zk- 

Zk = {di - d2)zk-i - {di + d2- l)zk-2 • 

Hence, if a, /3 are the roots of — {di — d 2 )x + {di + ^2 — 1) = 0, which we assume to be real 
and distinct, there are constants. A, B such that: 

Zk = Aa^ + ■ 


A and B can be computed using zi and Z 2 , which are positive, and since Zk eventually will go to 
oo, the fact that H > 0 follows. □ 

This finishes the proof of Lemma □ 

6.2. Proof of Lemma As was observed in [^, the spectrum of can be studied by relating 
it to the spectra of for 0 < r < 1. In fact. Theorem 2.2 of [21j is valid here as well; we will 

not present the proof, as it is applies verbatim, but we will introduce the notation and explain the 

quantities involved. 

Consider the matrix: 

Let be the matrix whose entries are given by 

t=i 

where the sum is taken over all self-avoiding walks from i to j of length 1. Finally, consider the 
matrix for 1 < m < /, whose entries are given by 

l—m I 

n ^**-i*‘ (6.8) 

t=l t=l—m-\-2 


Here we sum over paths of length I obtained by concatenation of two self-avoiding walks of lengths 
I — m and m — 1 respectively, the first starting at i and the second ending at j, with the additional 
constrain that they have non-empty intersection. 

Theorem 2.2 in [2T] gives the following equation 

i i 

Sii) = ^(i) + ^ ^ ; (6.9) 

m=l m=l 


In the decomposition above it turns out that the first and the third terms have small spectral norm 
and hence understanding the spectrum of Sd'> becomes equivalent to understanding the spectrum 
of the middle term. 

Throughout this section, unless otherwise noted, expectations are taken with respect to the 
randomness in the graph, given a set of labels a. Later, the dependence on a is removed with the 


help of Lemma 13 


To upper bound the moments of the trace of powers of and we will need a few lemmas. 

Recall Definition For any set E C [2n] x [2n] let Xe be the indicator of the event 
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that is, E is a, subset of the set of edges of the random graph G. Similarly we denote by Xe<^ the 
indicator of the event 

'■= '^{EnE(G)=0} 1 

when no edge in E is an edge of G. When E has one element we will use e instead of E. 

Lemma 10. Let G{n, d) be a random d-regular graph uniformly chosen. Let Ei,E 2 be two disjoint 
sets of edges with \Ei\ < 0(log(n)), i = 1, 2 and let e be another edge not in Ei or £' 2 .The following 
holds: 

Furthermore, if the edge sets {e} and {Ei U £ 2 } are disconnected, then 

The same hold for a random d-regular bipartite graph. 

Remark 6.2. There are sets £ for which Xe is identically zero, for example, if G is a random 
d-regular graphs, any set £ that results in a vertex having degree greater than d. Conditioning on 
the event {XeiXe^ = 1} implies that this is not the case. 

Proof. We only show the proof for regular graphs since for bipartite graphs the proof is analogous. 
From |24l Theorem 3] we have, for any set of edges £ of size at most 0(log(n)) and any edge e ^ £: 


¥{Xe = 1 \Xe,Xe^^ = 1) = - + 0 
^ n 


\Ei\ + 




F{Xe = 1 \Xe,Xe^^ = 1)<- + o( 

n \ 


¥{XeXE = 1) < n^E = 1) ( 1 + O 

n 


M 

n 


Now by inclusion-exclusion : 

n^E, = 1) - n^E.x^ = 1) < n^E.XE^, = i). 

eeE2 

The above can be used to obtain 

F{XeXE, = 1) 


( 6 . 10 ) 

(6.11) 


F(XeXE, Xe- = 1) 

F{X, = 1\Xe,Xe^ = 1) = ^ ^ -tT < 


F{Xe,Xe^^ = 1) - F{Xe, = 1) - EesE, mE,Xe = 1) • 


Now use (6.10) to bound the numerator from above and the denominator from below on the right 

'i^ii + ii^2r 


hand side to get 


P(Tle = 1\Xe,Xe^^ = 1) < =- + 0 

F{Xe, = l){l - OC-^)) n 




as desired. 

To prove the second claim, note that if the edge e is disconnected from the edge set £ then 
Theorem 3] yields equality: 


d 


F{XeXE = 1) = -F{Xe = 1) 1 + O ^ 


n 


I£| 


n 


( 6 . 12 ) 


also for disjoint £1 and £2 and n large this means 


F{Xe,Xe2 = 1 ) < If{Xe, = 1 ) . 


(6.13) 
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Now examine the following: 

= 1\Xe,Xe^, = 1) = 


nXeXE.XE^, = 1 ) _ n^e^E, = 1 ) " ^{X^Xe^^e^ = 1 ) 
nxE,XE^ = i) 


nxE, = i)-nxE,XE, = i) 

from ( 6 . 12 ) we see that the numerator on the right hand side is equal to: 

'l^il 


(6.14) 


^{n^E, = 1 ) - n^E.XE^ = 1 ))+ n^E, = w 




-^Xe^Xe, = 1)0 


|^l| + |^2| 




Plugging back into (6.14) and simplifying, we get: 


^ ¥{Xe, = 1)0 (^) - ¥{Xe,Xe, = 1)0 

^{X, = 1\Xe,Xe^ = !) = - + - ^ ^ ^ ^ 


n 

n 


nxE, = i)-nxE,XE, = i) 
\Ei\ + \E2[ 




where the second equality is by (6.13). This concludes the proof. 

The following is a very simple lemma whose proof we provide for completion. 

Lemma 11. Let X ~ Ber{q) with q < p + r, where 0 < < 1. For any integer m > 1, 

|E((X-p)"^)| <p + r . 


□ 


Proof. Assume q < p. Then: 

E(| A - pD < (1 - p)> + < p ; 

the latter inequality follows easily by noting that it is satisfied for m = 2, and that (1 — p)”*p + p™' 
is a decreasing function of m for all 0 < p < 1. 

If g > p, write q = p + r' with 0 < r' < r. We get: 

E(| A - pD < (1 - prip + r') + p- < (1 - p)-p + + / < p + r , 


due to similar considerations. 


□ 


A particularly important point, related to using Lemma [Tol will be to examine the possible 
number of disjoint edges in an ordering of a given set of edges. 

Given a set of ordered edges E = {ej}, we say edge Cj is disconnected if the sets {ej} and 
{ei, 62 ,..., ej_i} are disconnected. We denote by d{E) the number of disconnected edges of E. 
Clearly this number depends on the order of the elements of E. 


Lemma 12. Let G be a graph with maximal degree equal to d and E a subset of edges of G. Then 
there is an order of the elements of E such that: 


6{E) > 


2d 


Proof. We denote by [|F1|] = {1,2,..., |F1|}. 

We claim that the following algorithm finds a bijection tt : [|F1|] —)• E with the required property: 
Choose an edge eoi E and consider the subset E{e) of all the edges of E that are adjacent to e. Since 
i? is a subset of the edges of a graph with maximal degree d we have |F^(e)| < 2d. We will add e and 
the edges in E{e) at the end of the ordering; namely, we let 7r(i) G E{e) for \E\E{e)\ + 1 < i < |i?| 
and Tr{\E\E{e)\) = e. We have used at most 2d edges. We now exclude all those edges from our 
set E and continue constructing the bijection by recursion, until no more edges exist. Note that 
if we add the edges in the order given by tt the construction ensures that e is disconnected. Since 
each time we exclude at most 2d edges the results follows. □ 
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Proposition 3. Let -E be a set of edges of G{n,di,d 2 ) with \E\ = K = 0(log(n)). Let rrii, 
i = 1,... ,\E\ be positive integers. The following holds: 


n=K 




\(y 


. i=l 


< 


/i=K , 

n —1 ( 1+0 


. i=l 


n 




n 


i / ’ 

\n J 


where d^^ = di if e* has both endpoints in the same clusters and de^ = d 2 if not, and 


CJ = 1 + 


2d 


if 5{m,=i} > 0 and a; = 0 else. 
Proof. For 1 < s < iL write: 


Xs = \{{A,,-Ae 


2=1 


Also denote by Qs the u-algebra generated by {Ae^, Agj,..., Ae^}. As a first step, we will show that 


i=K 


|E(X^|a)| <Yll^ + 0 
\ 


2=1 


dp- 


log(n) 




n / 


Thus, 


|E(Ai^|u)| = \&{E{XK\a)\GK-i)\ = |E((Xi^_i|a)E((Ae^ - Ae^r^\a)\QK-i)\ (6.15) 

The last equality follows by observing that A^-i is ^x-i-nreasurable. If mx > 1, noting that 
Xpj^ = Ae^, Lemma 10 implies: 


E{Aej^\a,QK-i) < 


n \ J 


If we now apply Lemma 


II 


with q = E{Aej^\a,gK-i), p = and r = 0 we obtain 


miAp, - Ap,r^\a)\gK-i)\ < -^ + 0 


log(n) 




Substitute in (6.15) to obtain 

\E{XkW)\ < \E{Xk-i\c7)\ 


-^ + 0 
n 


log(n) 




On the other hand, if rriK = 1 and ex is disconnected then using the second part of Lemma 10 
one can see that 

|IE(Aej^ — Ae^)|cr| < ^ . 




To complete the proof, we reorder, if necessary, the edges of E such that we have the maximum 
possible number of disconnected edges with the property that the corresponding exponent ruj = 1. 
By Lemma 12 this is equal to a;, as defined in the proposition. 

We conclude that. 


i=K 

|E(A;^|a)| < n 

2 = 1 

The proof is complete. 


dpi 


n 


+ 0 


log(n) 
2 




/'i=K 

n 

V 2=1 


dp. 


n 


1 + 0 


log(n) 

n 


□ 


The next lemma considers the expectation under the measure generated by the labels. 
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Lemma 13. Let (T, o) a subtree of Q{n,di,d 2 ) with at most 0(log(n)) many edges. Then: 



where indicates we are taking the expectation over the measure generated by the labels. 


Proof. Let w he a leaf of T. Let be the a-algebra generated by the labels of all vertices in T 
but aw We have: 




d, 


dp. 


eST eST eeT,e^e 

Now we check that Pa{de = di) < 1/2 + O ^^. Given any event on this is, any labeling of 

the vertices of T expect for w, denote by s"*" the number of positive labels and by s~ the number 
of negative labels. Recalling that s = = 0(log(n)), we have: 

PM - J 1 - _ n - s+ ^ 1 /log(n) 

- <il) - -pSZTT- - <5+0 

An analogous bound holds for P(dg = d 2 ). We conclude that 

Tog(n) 


n 




n 


2n 


.of 




Repeating this argument we get: 


E„ 


n 

\e&T 


d. 


- < 

n / 


di + d2 
2n 


+ 0 




for n large. This completes the proof. 


□ 


We now have enough tools to examine the spectral radius of which we denote by /^(Ad)). 
For any integer k we have p(Ad))2^ < Tr((Ad))2*^) and hence the same inequality holds if one takes 
expectation. From the definition of A^) we have: 

E(rr((A<'l)“)) = E(n(.4., - A..)”') (6.16) 

ceC 

where C is the collection of cycles in Q{n, di, d 2 ) of length 2kl obtained from the concatenation of 
2k self-avoiding walks of length 1. Also, m, is the number of times the edge is traversed in one 
of such cycle. To bound the expectation from above we need to bound the number of such cycles; 
this was done in [2T], but we include the argument here for the sake of completeness. 


The idea is to bound the number of cycles in C with v vertices and e edges. Given one of these 
cycles, number the vertices by the order they appear in the cycle, starting at 1, and denote by T 
the tree of those edges of the cycle which lead to new vertices. 

R is crucial to note that the listing of a cycle is in order and thus it tells us how it was traversed, 
so the above enumeration and T are well defined. 

Recall that each cycle is the concatenation of 2k self-avoiding walks of length 1. We will break 
each path into three types of sub-paths and then we encode these sub-paths. To do so, we start 
traversing the cycle, and we check each time we found one of the sub-paths described above. Given 
our position on the cycle and the tree of the previously discovered vertices we represent each type 
as follows: 
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Typel These are paths with the property that all their edges are edges of T and have been traversed 
already in the cycle. They can be encoded by its end vertex. This is because our sub-path 
is part of a self-avoiding walk, and it is a path contained in a tree. Given its initial and its 
hnal vertex there will be exactly one such path. We use 0 if the path is empty. 

Type 2 These are the paths with the property that all their edges are edges of T but they are 
traversed for the first time in the cycle. We can encode these paths by its length, since they 
are traversing new edges and we know in what order the vertices are discovered. We use 0 
if the path is empty. 

Type3 This is just an edge that connects the end of a path of type 1 or 2 to a vertex that has been 
already discovered. Given our position on the cycle, it is clear we can encode an edge by 
its final vertex. Again, we use 0 if the path is empty. 

Now we decompose each self-avoiding walk into sequences characterizing its sub-paths: 

{Vi,qi,ri){p2,q2,r2){...){pt,qurt) 


Here, pi characterizes sub-paths of type 1, qt characterizes subpaths of type 2 and r* characterizes 
sub-paths of type 3. 

Note that pi and r* are both numbers in {0,1,..., u}, since our cycle has v vertices. On the other 
hand, qi G { 0 , 1 ,..., Z} since it represents the length of a sub-path of a self-avoiding walk of length 
1. Hence, there are {v + 1)^(/ -|- 1) different triples. 

We must now see in how many ways we can concatenate sub-paths encoded by the triples to form 
a cycle. First, note that r* = 0 only if {pi,qi,ri) is at the end of a self-avoiding walk. Hence, all 
other triples indicate the traversal of an edge not in T. There are e — v + 1 such edges and each of 
it can be traversed at most 2k times in the cycle. Hence there are at most {{v + 1)^(Z -|- l)) 2 fc(e-t;+i) 
triples with > 0 and there are at most {{v -|- 1)^{1 -|- 1 ))^^ triples with = 0 . 

We conclude there are at most 




(6.17) 


cycles with v vertices and e edges. 

Recall that we want to bound the right hand side of (6.16). Denote by v{c) the number of 
vertices visited by the cycle c. Let us split C into three subsets Cj, j = 1, 2, 3 as follows: 


. Cl := {c G C 
. C 2 := {c G C 
. C 3 := {c G C 
Clearly C = [jCj. 
For j = 1, 2, 3, let 


all edges in c are traversed at least twice.} 

at least one edge in c is traversed exactly once and v{c) < kl + 1.} 
at least one edge in c is traversed exactly once and v{c) > kl + 1.} 


cGCt 


IE 


Ra 


e, - AT- 


From (6.16) we then can write: 


E(rr(AW)2^) </i+l2 + /3- 

We will bound each Ij separately. For Ii we have by Proposition]^ 

G(c) 


(6.18) 


ceCi 


, i=l 


n 


log(n)" 


n 


where e(c) denote the number of different edges traversed by the cycle c. Note that since all edges 
in cycles of Ci are traversed at least twice we have w = 0. The same condition implies that each of 
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these cycles have at most kl different edges, since its total length is 2 kl, and at most kl + 1 vertices, 

log(n)^' 


since each c is connected. Use (6.17) to get: 

kl 




e-.i+l) I 77 E 

n 


n 

V 2 =l 


1 + 0 


n 


v=l+l e=v—l 

Note that the right hand side depends on the label of the graph. We will average under the 
randomness induced by the label in the following way: for each cycle c, recall that T is the tree of 
spanned vertices, this is the tree of those edges which discover new vertices when traversed. For 
any edge e G c not in T use the bound < (di V ^ 2 )- Now take expectation with respect to the 
labels over T. From Lemma [13] we conclude: 
kl+l kl 


h < 


E E 2(2„r[{„+1)^(1 


n 


v=l+l e=v—l 

< m{l + o{l))[{kl + 2f{l + l)f'^n{di+d 2 )^^. (6.19) 

To explain the second inequality, note that since kl = O(logn) the only terms that are asymptoti¬ 
cally significant are the ones for which e — u + 1 = 0. We now bound from above by kl times the 
highest term. 

Using the same kind of reasoning and Propositionwe obtain the following bound for I 2 '. 


'2 s E E 2(2n)^[(u + l)2(Z + l)]2^(i+^ 

D=i+1 e=v 


2 n 


n 


< 4A:/(1 + o(l))[(A:/ + 2)‘^(l + l)]^^((ii V d 2 ){di + (^ 2 ) 


xkl 


( 6 . 20 ) 


Note that now e > v since each c is a closed path and at least one edge is traversed exactly once. 


and we have used the trivial bound ( 


0 j{c) 


< 1 . 


To bound I 3 , note that for each c £ C 3 , from Proposition!^ 


e(c) 

i=l 


< 


+(c) 

n— I D+o 


, i=l 


n 


log(n) 


n 


e{c) 


n 


u{c) 


( 6 . 21 ) 


The notation uj{c) above is to indicate that the value of uj from Propositiondepends on the cycle 
c and the order of the edges {cj}. 

Note that the right hand side of (6.21) is decreasing in u. Our strategy will be to show that if 
v(c) is large then w(c) is also large and thus the right hand side in ( | 6 . 21 ) is small. 

More precisely, let c G C 3 be a cycle with v(c) = kl +1 and denote by e(c) the number of edges 
that are traversed exactly one in c. We have e(c) > v(c). Since e(c) — e(c) edges are traversed at 
least two times and the length of c is 2 kl we have: 

e(c) + 2 (e(c) — e(c)) < 2 kl 
which implies e(c) > 2t. By Lemma [T^ we get: 

/ \ t 
u{c) > 


di + <^2 


( 6 . 22 ) 


Combining (6.17), (6.21) and (6.22) we get: 


2,kl ‘2kl / j j \ \)—1 / 

E E2(2"n(»+i)4i + i)i"‘’''+'-+‘M^M^) ( 

— L'/_L0 P='7I ^ ^ 


h < 


{di V d2) 


v=kl+2 e=v 


n 


e—ti+l 


v — kl 

€ \ di +^2 

n 
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Rewrite the right hand side above as: 

f; f; 4(rfi V dMv +i)^(i +1)]-* ) {d, + d,y 


v=kl+2 e=v 

We have: 


l 2 fc 


v — kl 

,_1 f ^ \ di+d2 

n 


(di V d 2 )[{v + 1)^(^ + 1)] 


2k 


n 


< 1 


kl-l 


for n large. Note that the numerator is bounded by some polynomial in log(n). 

Also note that 

e-kl / 1 \ V — kl 

(dl + d2)"-' (^) < (dl + d 2 f-^ (^(dl + d2) (^) j < (di + d2) 

We conclude that 

2kl 2kl 

h < 5^4(dl Vd2)[(u + l)2(/ + l)]4^(dl+d2f-' 

v=kl+2 e=v 

< 4(fc/)^(di Vd2)[(2A:/ + l)2(/ + l)]4fc(di + d2)^'-^ (6.23) 

Substitute (6.19), (6.20) and ( 6.23[ ) in (6.18), and note that the bounds for I 2 and Is are negligible 
compared to the one for R, we see that 

E(p(AW)2 ^) < 12 kl{l + o{l))[ikl + l)\l + l)]2^n(di + d 2 )^'. 

Finally, given e choose k such that 2ke > 1. We can now apply Markov’s Inequality and obtain 
the desired bound on 

pwA»)>„.(4+rfy/^) < 

< I2kl{l + o(l))[(A:/ + 2f{l + l)]2V-2^" 

= = 0(1) . 


More generally, the same counting arguments and Markov Inequality can be used to show the 
following probability bound for for all m = 1 , 2 ,..., /: 

P > n^(di + da)'/^) < O ((/ - = o(l) . (6.24) 


Let us now turn our attention to the spectral radii of (recall ( 6 . 8 )). 

Denote the spectral radio of each such matrix by /^(r^*’™'^). For any positive integer k, we have: 


E(p(r('’"^)) 2 fc) < E(rr((r('’”")) 2 '=)) = ^e(]J (Mej”" 0 - 

c£T> 

The right hand side is the sum over the set P of cycles c of length 2kl each of which is obtaining 
by concatenation of 2 k paths, with each of those paths being a concatenation of two self-avoiding 
walks of length I — m, respectively, m — 1, and with non-empty intersection. The entries Me^ 
correspond to either {A — A)ei, Ae^ or Ae^ and rrii is the number of times the edge e* is traversed 
in the cycle c. 

We want to bound the number of such cycles. The same representation from [21] we used to 
count the cycles in C gives the following bound for the number of such cycles with v vertices and e 
edges: 

De,„ := v‘^%V + if {I + l)]4fc(l+e-^+l). 


(6.25) 
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Note that we have at least (m V Z — m + 1) different vertices, since there are at least two self¬ 
avoiding walks of length {m — 1), respectively {I — m), in each cycle; there are at most 2 k{l — 1) 
vertices because each length I path is the concatenation of two self-avoiding walks with non-empty 
intersection. 

Let c be one of these cycles; we need to estimate E(n 

We know that exactly 2k edges of c contributed Ae^, counting multiplicity. We can bound their 
contribution by _ What is left, for each e^, has the form A'^^{A — Here n, is the 

number of times the edge e* is weighted by Ae^ and rrii is the number of times the same edge is 
weighted by {A — A)e^. If Ui > 0, because = Ae^^ 


hence 


- A)^;) < E(^,(l - A)^;) < ^ 


If nj = 0 we can use Proposition]^ to bound the term directly. Combining these bounds with (6.25) 
and use Lemma 13 to get the following bound on E(rr((r(^’”^))^^)): 


2k(l-l) 2k{l-l) 

^ 2(2n)V^[(u+l)2(/+l)]4^(i+"-^+i) f f 

v=mW(l-m+l) e=v-l ^ ^ ^ 


di V ^2 
2n 


e—H+1 


{di V d2) 


n 


2k 


< Skl{l + o(l))n[(2A;(/ - 1) + 2 f{l + 1)^]" 


2 i 2 fc ( (^1 + ^ 2 )^ 


n 


(6.26) 


we have employed the same considerations here as in (6.19). 
Note that 

{di + d2y 


n 


< 1 , 


because of our choice of I (see Proposition . 

Given e, choose k such that 2k€ > 1 and use Markov’s inequality again to get: 


]P(p(r(«.™)) > ^ 6 ^ 


^ E(p(r('’™)) 2 ^) 

— ^ 2 ke 

< kl{l + o(l))[(2A:(/ - 1) + 2f{l + l)2]2V-2^" 


0 ( 1 ) . 


Remark 6.3. Note that in the case of each spectral bound for of for 1 < m < Z 

we showed, the probability of the spectral radius being larger than the bound decays roughly like 
^i- 2 A:e ^ large enough. Since the total number of such bounds is 0(/), so logarithmic, we can 

conclude that all of them happen simultaneously with high probability. 


Finally, we have all the tools to prove Lemma 

Proof of Lemma^ We have shown that + E, where i? is a small- 

spectral-radius perturbation (the sum of and for m = !,...,/). We will now focus our 

attention on the remaining (significant) term. 

Let Tm be the set of vertices which m-neighborhood is a tree, we have, for i £ Tm 

2n 

(5(™)e), = Y.s\f = \dB^{i)\ = (di +d2)(di + d2 - I)'"-'. 
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For i ^ Tm'- 

2 n 

= Y.s\f < 2\B^{i)\ < 2{di+d2r, 
i=i 

since the Z-tangle-freeness of the graph implies that < 2 for each i and j. 
We then have 



2 n 

< 


+ 

Y ^iidi + d 2 )idi +d 2 - 1)™ ^ 


i=l 


i^'Trn 


i&Tm 




+ 


^ Xi{di + d2){di +d2- 1 )™ ^ 
i^'Trn 


The last equality uses the fact that x'e = 0. Using Lemma and Cauchy-Schwarz’s inequality, we 
obtain that 


The proof that 


|g/^(m-l)^| < ^ri^/^{^di+d 2 Y 

\a’S^^-^'>x\ <3Y/Ydi + d2y 


(6.27) 

(6.28) 


is analogous. 

To prove the inequality of the lemma, namely that 


||5«x|| 2 <n^(di+d2)'/'(l + o(l)) , 

recall the matrix A defined in and decomposition ( |6.9[ ). We have shown (see Remark |6.3[ ) that 

max V(A«),p(r('’"^))}<n^(di+d2)'/' , 

with high probability. Then 

||AS(”"-i)x|| 2 < ^||5(™-i)a:||2 + O . 

Bound the spectral radii of by 0{{di + ^ 2 )™”^) and use ( 6.27[ ) and (6.28) to get: 

\\AS^^-^Y\\2 < 0in-\di+d2r-y + 0{n-^/^+^/\di + d2r) 

= 0{n-^/^+^/\di + d2D 
= 0{n-^/^+^/^{di+d2y) 

= o{l) , 

as {di + (^ 2 )^^^ = n^, and (i < 1/4. 

Finally, using (6.24) and putting it all together, 

i 

I| 5 «x|I < (/ + l)n^(di + d2f‘^ + ^ nAdi + d2Y-^^/^o{l) = {l + iXidi + d2)'/'(l + 0(1)) . 

m=l 
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With this, the proof of Lemma is completed. □ 
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